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ABSTRACT 


This paper considers estimation of parameters for a 
model currently being used to study the Illinois Interlibrary Loan 
Network (ILLINET) and presents a method for determining how much data 
should be collected and how it should be aggregated. Four classes of 
model parameters discussed: demands, probabilities of success, 
processing times, and delivery times. This approach is being used as 
a means of deciding when to terminate data collection as well as 
determining the number of request classes to use in the policy 
analyses being performed with a library network model. This paper 
shows how model parameter uncertainty affects prediction uncertainty 
as a function of sample size and number of request classes. As sample 
size increases, uncertainty decreases. However, as the number of 
request classes increases, the sample size in each class decreases 
and thus, uncertainty increases. An appendix presents a new version 
of the model used in previous studies which could be applicable to 
many different types of networks. (DAG) 
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This report is based on a paper prepared for presentation at the 
1976 Joint National Meeting*of the Operations Research Society of America 


and The Institute of Management Sciences. The purpose of this summary 


is to discuss in general the mathematical derivations presented -in this 


report and thereby allow the reader to move immediately to the example 


and discussion in the later part of the report and avoid the detailed 


equations, 


The library network model which we have developed predicts 


probability of satisfying a request P, average time from initiation of 
: is 
a request until receipt of the desired item W, and average cost to satisfy 
a request C. To make these predictions, the model requires input data 
or parameters. These parameters are Scien using samples of data from the 
network of interest. (See discussion in Project Report No. 3.) If the 
size of the sample is small, then our parameter estimates will be poor 
and -we will be uncertain of the "truce" parameters. As the size.of the 
sample increases, uncertainty decreases. Uncertainty in the model 
. parameters will lead to uncertainty in the model's predictions. 
The primary purpose of this paper is to show how acdel parameter 
uncertainty affects prediction uncertainty as a function of sample size 
and number of request classes. As sample size increases, uncertainty 
decreases. However, as the number of request classes increases, the sample 
size in each class decreases and thus, uncertainty increases. The method 
_ deve Loped in this report can be used as a guide for choosing sample size 


and number of request classes. 


~ 
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¢ 
An example, utilizing demand data from Project Report No. 3, 


shows how the modified coefficient of variation (Lge o5-/x) s affected 


by sample size (% year) and number of request si ell For. a” given value 


<2 


of the modified coefficient of variation (say a), WB ca say thie’ “wage : 


ow nes 
95% confident that the true average performance is leis fe percent) 


omens 


of the estimated average performance. Chodsing a desired maximum ‘for “a 


defines the minimum sample size and/or maximum number of request ‘classes. 


mony 


In an appendix, a new derivation of the pode l inoprepented dl that 


~ requires fewer assumptions and allows” more robust representations” OF, Tibriity 
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INTRODUCTION e ae 
In an earlier paper, the development of a library network model 
was reported [1]. The model predicts network performance as it is affected 
by abeiant configuration in terms oF. operating policies and alternative 
technologies. Network noir foie nittiadvtined as having several components: 
probability of satisfying a caquane, average time from initiation of a 
request until receipt of the desired item, average unit and total operating 
Leswacd, os siabbige tied average request processing~loads-throughout~ the network: 


Since the wedet was developed in 1974-1975, it has been applied 


sored. 


to analyzing the L[llinois ltbtaey and Information Network (ILLINET) as 
a whole (2) assessing the impact of computer technology (e.8-5 OCLC) on 
interlibrary loan networks inugeneral and ILLINET in particular Ol, and 
analyzing one of ths vagtoual networks of ILLINET [4]. 

The purpose of this paper js twofold. First, an Apress or 
this paper presents a new version of the model that is more general and 
allows representation of many different types of networks. Second, and 
perhaps more important, this paper will consider estimation of the model 


parameters and present a method for determining how much data should be 


collected and how it should be aggregated. As the reader will see, para- 


meter estimation is quite straightforward while the question of appropriate 


sample sizes and levels of aggregation is more complicated. 


PARAMETER ESTIMATION 
There are four classes of model parameters: demands, probabilities 


of success, processing times, and delivery times. In this section, we will 


define these parameters, briefly “note how they can be estimated, and point 
out how appropriate confidence intervals can be determined. 

The average number of requests per unit time generated by 
requesting! Library k in request class j is denoted by Ain’ jf 22 asegT3 
k = 1,2,...,K. “A requesting library is a request-gathering organization 
that can deal directly with the network. Thus, requesting libraries need — 
not be "libraries" in the traditional sense. Request class can represent 
subject area, type of request (e.g., document or information), type of 
renieeect, ate. 

The eietied demand as defined by A se is estimated in t'e usual 
ere The general approach to determining confidence intervals for 
Aik can be utilized [5, Chapter 11]. : If we assume that request generation 
can be*modeled as a Poisson process, 3 very simple formulation is possible 
(6, 2 354]. 


the probabLlity that resource \library £ will satisfy a class J 


request at the Pig stage of referral is denoted by Pala J Ly2ye0 052s 


j 2 1,2,...,3; £= 1,2,...,L. This probability is estimated in the usual 

manner with the confidence interval being straightforward [7, pp. 216-219]. 
; The average time cor resource library i to successfully process 

a class j request at the gth stage of referral is denoted by ise while 

the average time~to~unsuccessfully process such a request is denoted by 

; Mayet These estimates and confidence intervals are calculated in the 

usual way. 


The average time for a class j item to be delivered from resource 


library i to requesting library k is denoted by t Again, the 


ijk* 


estimate and confidence interval can be found in a straightforward manner. 
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The above discussion of parameter and confidence interval esti- 
- oa eas ag quite terse, reflecting the fact that the appropriate procedures 


& . can Bf found in any of a number of undergraduate textbooks. .We will now 


a 


_ consider ow uncertainty in the model parameters leads to uncertainty in 


ei ae R ctions of the model. 


2° gh 
* Ve - 7 
* DATA, COLLECTION 
ae . % Our nietwork performance measures include* 
? ¢ 
; 
; J K 
P= £ £ Ap, /A (1) 
je ker Jk Jk 
% i ’ 
oF . 5 ‘ 
ie ie aa we P.,W.,/PA (2) 
ae ini Re  akP pM . 
dina 
re c 5 5 h /PA 3) 
a oe = =. 41,¢ 
oe es jet kat JES SR 
where 
wae ry xe 
eee gy Aw 5 ED (4) 
Fe jalteny J . 


and P ts the probability of the network satisfying a request, W is the 


average time from initiation of a request until the desired item is received © 


wantin © andy “ome * yaad Sate 


“by the Fewest ing brary, ¢ i “the average cost per satisfied request, 


t. itnd” wit Fie average total demand on the network. 
. ‘ 


\ 


- *the ‘equations utilized throughout this section represent the manipulation 
of parameter estimates as opposed to the "true" parameter values which 
“one usually cannot obtain. ‘To simplify notation, we have not annotated 


+ ours symbols in any way to show that they are estimates. 
7 * ° Ps -, 


\ teea” 


' 
The components of equations 1 through 4 are defined in the 
Appendix in terms of the model parameters discussed in the previous section 
of this paper. We now want to eoaubine how uncertainty in the model para- 
meters results in uncertainty in P, W, and C. It will be assumed that 
the estimated parameters are random variables drawn from appropriate 
sampling distributions. Further, the quotients of the sample variances 
and sample sizes will be utilized as estimates of the variances ss the 
sampling distributions. 
Observation of equations 1 through 4 and those in the Appendix — 


shows that we must frequently consider sums, products, and quotients of 


2 


random variables. Thus, let us review a little probability theory before 


“ye begin our analysis. 


If y = x) + Xo +... + X, where XpoXore eek are random variables, 


N N 


N 


E(y) = © E(x (5) 
i=l 


D> 


and if x are linearly independent, then {7, p. 108) 


pear o%y 


N 
o? a. 2 o 
isl %& 


where E(-) denotes the expected value while ae denotes the variance. 


Ify=x where KpeKove reek are linearly independent 


Xp t%y 
random variables, then’[7, p. 56] 


N 


N 
K(y) = mW E(x,) 
i=l 


10 


Fg 2 ue 2 
J os mlo +E(x,)°] - Cm E(x,))” . (8) 
eee, isl 4 i=l 


Division of random variables is much more complicated. For 
example, if y = 1/x, the moments of the distribution of y are not ‘definable 
in terms of the moments of the distribution of x [8, pp. 128-129]. Thus, = 
_the 1/P terms in equations 3 and 4 en potentially troublesome. 

' The. approach adopted in the following analysis is to look for 
sums and products of linearly independent random variables and avoid 
quotients of random variables. Not infrequently, we will Per random 


variables to be linearly independent when in fact a weak relationship 


actually exists. This approximation is necessary if the analysis is to 
be tractable. 


We will first consider the network probability of success as 


defined by equation 1. This equation can be rewritten as 


J 


K 
\ P= DY DR a Pp (9) 
ae y eg ed 4s ie EOE, 


where a,, =A /A is the proportion of class j' demand originating at - 


jee 9 
requesting library k. Assuming the probability of success to be independent 
of the request source, equations 7 and 8 can be utilized to determine 
E(P) and oe respectively. 

Assuming that the a requesting library generates n,. requests 
over some period of interest and that these are categorized into J classes 


of approximately equal size, then 


11 


To determine: the expected value and variance of Pin? we first 


note that (see Appendix) 
L 
Pie Pie (13) 
where Pit is determined from model parameter Pijle and a knowledge of 
the request routing policy employed. To avoid having to prescribe a parti- 
cular request routing policy, it will be assumed that Pals does not vary 
with i. In other words, it will be assumed that all resource libraries 


are identical. This assumption results in 


4-1 : 
Pig = P m (l-p,)_) . (4) 
pee” P5fe Palm 
where Pils is Piss with the i subscript dropped to reflect our assumption. 
Equations 5 through 8 can then be utilized to compute F(P),) 


and o? in terms of E(p |s) and o which are given by 
jk j Pils 


E(P5 |) = Pils (15) 


) P.| g(l-p, 1 9) 
ot = alle ile (16) 

34 ils 
where nile is the sample size upon which the estimate of Pils is based. 
If one assumes that requests are uniformly distributed among resource 
libraries at each stage of referral, then it is straightforward to show 
that 

N £-1 
J 


_— 1- ° 17) 
my]a'T an Pil (17)... 


4 


It would simplify matters substantially if one could choose 
values of Pile’ In many situations, p = 1/2 maximizes uncertainty and 
thus, we will ELE RS GP | 2 = 1/2 for all 7868 4. With this assumption 
and recognizing two truncated geometric series [6, p. 99) that appear 


upon combining equations, one obtains 


E(Py,) = 1 (1/2) e —8ig5 


i t 
o® «= £ (i/2)74 Cr Co ghek ay eu) 4 (19) 


Pik’ fel m=1 

Thus, equations 10, 11, 18 and 19 are sufficient for calculation 
of on. At this point, it is interesting to note that data collection 
involves two basic decisions; choosing J and N. In other words, one has 
to decide how much data to collect and how many request classes to utilize. 
Studying equations 10, 11, 18, and 19, one notes that increasing N always 
helps decrease a. 
as o* and o are effected, in opposite ways by changes in J. We will 


ik Pik 
return to..this point in the discussion of an example. 


However, the effect of J is not as straightforward 


10 


Now, we will consider W, the’ average m initiation of 
a request until the desired item is received by the requesting library, 
as defined by equation 2. To avoid the difficulties posed by the 1/P tem, 
we will rewrite this equation as 
J kK 


PWw= © f 
j=l k=l 


ak Pik Wik (20) 


where Fk retains its previous definition. Our approach here will be to 


determine E(PW) and oF and then utilize equations 7 and 8 to work 


‘backwards to obtain E(W) and a This is possible since P and W are 


linearly independent. (W.is a function of P (equation 2), but the 


relationship is not linear.) 


Utilizing equation Al0, we obtain 


From the Appendix, we note that MT) is defined in terms of Miya Wise 


and tik If we assume the expected values and variances of these model 
a ~ 2 
parameters to be E(w), E(w), E(t), a on and of respectively, then one 


obtains 


E(w., ,) = (4-1)E(%) + E(W) + E(t) 


jke 


% + L of] 
: ny (1-(1/2) ) 
where, as in the derivation of equations 18 and 19, the recognition of two 
truncated geometric series: considerably simplifies the resulting expressions. 
Knowing the moments of Yhkch and the previously defined moments 


of Pike and a kr one can use the properties. of sums and products of linearly 


j 
.independent random variables, to determine E(W) and a. Since it will 
later be of use, we should also note that quite similar manipulations 

2 
z 
will yield E(w.) and cae 
Now, we will consider C, the average cost per satisfied request, 
as defined by equation 3. In a manner similar to that with which we 
approached W, this equation will be rewritten as 
JK 


m= £ fa 


jel ket JE SK “Gk 


As before, our approach is to determine E(PC) and re and then work 
backwards to obtain E(C) and on ‘* 


Utilizing equations in the Appendix, we will define 


= = rl : om c 
dig ™ Pye Cpe 7 Pyle + Pyyle - CLUDE] + Py, es, 


(27) 


with Cand ¢ being the cost of satisfying and not satisfying, respectively, 


a request at any particulier resource library in the network. 


22. 


ajeoaimsnerer,——_fnkyeteataneinwe Ak Sa Nn hse 


x 
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‘the moments of e,, can be determined using equations 5 and 6 


jk 


while the moments of d,, are somewhat more complicated requiring fourth 


jk 
moments [8, p. 162] of what will be assumed to be a normal sampling distri- 


bution. Combining the moments of d jk? and our previous results, E(C) 


_ 
and on can be determined. Also of use are the moments of at which are 
straight forward and given by 


E(e 5) = Le + [e - (Lt1)¢] E(P..) + B(e 5.) (28) 
of = (8 - (mee ot +02 (29) 
“jk Pjk “5k 
2 2 2 : 
We now can compute op» oy and eo, as a function of the uncertainty 


in our model parameters. Assuming that the sampling distributions of 
P, W, and C are normal, we can determine the 95% confidence intervals for 
each performance measure using + 1.96 o. In other words, we can acinewtin 
an interval such that we are 95% confident that the true average parfor- 
mance measure lies in that interval, 

To further refine our measure of uncertainty, we will define 
a modified coefficient of variation as 1.96 o/R(+). Suppose this 
coefficient has some value x. Then, we could say that we are 95% confident 
that the true average performance is within 100 x percent of the 
skbinatad ‘slaves performance. 
AN EXAMPLE 

In reference 2, we discuss an analysis of LLLINET based on a 
sample collected in February 1975. The sampling scheme resulted in accumu- - 


lation of one day's demand_from each of 19 requesting. libraries._..In.terms. Riemer 


16 


13 


of one year (250 working days), this was a 0.4% sample. This sample gives 


us estimates of ns k = ee for use in the procedure presented 
in this paper. The other input necessary to the procedure includes 
E(w), E(w), E(t), a, oh ¢, and ¢. For these variables, we will choose 
realistic but arbitrary values of 5,5,5,5,5,l, and l. 

The modified coefficients of variation for P, W, and C are shown 
in Table I as a function of sample size (N) and number of request classes 
(J). Note that.uncertainty is fairly sensitive to N but almost insensitive 
to J. We discussed the reason for this result in our derivation of oe 
Even froma purely intuitive poine of view, this result is not surprising 
since it would seem that our aggregate performance measures should eat be 
too sensitive to disaggregation of component measures. 

Using Table I, we can reach conclusions such as: At least a 
2% whnpiads days) is necessary to maintain uncertainty-under 10% for all’ 
performance measures. A law of diminishing returns is evident even in 
the rounded-off entries of this Table. For example, a 4% sample does not 
yield one half the uncertainty of a 2% sample. 

To consider uncertainty on a less aggregated level, the modified 


coefficients of variation for Pik? Wine and c,, are shown in Table I1.* 


j jk 

Recall that j refers to request class while k refers to requesting library. 

We will call performance at this level type jk performance. As one would 

probably expect, uncertainty in type jk performance measures is quite 

sensitive to J. u 


Considering the results in Tables I and II, one might waht to 


choose different accuracy goals at each level. For example, 5% uncertainty 


was calculated using the average ny soo. ; 
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*the coefficient for w. 
ans ppllncaireas sepipasaisaessbenienalovevanencqvaceaeesnose=apuaheal fe 


SAMPLE ' NUMBER OF REQUEST CLASSES 


$1ZE " res t 
(xvEAR) 4 e> 3 a 5 t. ? ) 9 10 

@.4 P 0,7 0,07 O.A7 A,A7 GeA7 O,A7 0,07 8,07 0,08 2,08 

Wo M2 B02 0,02 0,03 4.03 0,03 0,03 0,03 0.93 2,03 

C 4,20 U.2@ M,2t 0.2) 21 0,22 G22 4,22 A,23 0,23 


0,6 P 8.25 0,05 0.95 0,05 8,05 O,45 0,95 0,05 0,05 2,¢5 
W 8,42 ane @,02° 0,92 4,02 2,22 0,02 8,02 0.02 @,92 ~--- 
C 8,14 O,19 0.14. 0,19 4,19 8.135 8,15 9.35 0.15 2,15 


O40 8.02 7.94 8,494 0,204 4,08 0,324 38.94 A,.04 8,44 
DeMl1 6.01 A,01 8,41 H,01 3,01 0,51 2,0! 
1% O11 0,12 O12 A,12 8,12 A,12 4,12 A132 8,12 


az v 
s 
s 
— = 
—_ 
s 
es 
2 
- 


1.6 P 0.24 Bend 0.94 U,04 4,09 8,08 3,04 8.09 6,24 0,00 
WH O,At A,Ut B61 O,01 .01 Bat OAL BeAt OeOL BM 
C A102 8,10 9.1% B,18 4,18 G,1A B10 A,10 0,10 2,10 


2.0 P 4.93 O.n3 0.03 0,03 0.03 0,03 0,03 4,03 0,03 20,03 
W O,@1 @.01 A.A! @.0L @.01 O03 O01 O01 O,a1 2,01 ° 
C 0,49 4.09 9,09 @.09 @.09 0.99 0.09 @,.09 @,09 2,09 

2.4 P 43 4,93 4.43 4.03 0,03: 2,03 0,03 4,03 0.03 2,93 
W OO O01 01 A,01 @eAl @,@1 @.01 O01 O,01 2,01 
C 8,78 9.08 0.08 0,08 0,08 0,08 0,98 0.08 2,08 2,08 

2.8 0,23 0,03 @,93 0,03 0,03 4,03 0,03 0,03 0.03 0,03 


@,.A1 6,0) G,0f A,01 4,01 8,01 0,231 6,21 
0,47 0,07 A,07 0,87 6,07 A@,08 0,28 0.08 6.08 0,08 


oOzv 

Ss 

e 

Ss 
- 
3 

e 

i] 
- 


3,2 P 9.93 8,03 9,H3 0,93 4,23 8,43 0,03 4,03 0,03 a,93 
: w 9,41 G41 O,01 4,01 1,03 0,91 G81 9,04 @,81 8,01 
C 2,97 A.A? O,07 8.27 8,87 A.A? A,07 B.A? 8,97 Cu? 


@,02 0,02 4,42 4,02 ¢,02 0,02 0,42 4,02 @,h2 4,02 
O.°7 8.07 8,97 O,F7 G87 4,97 4,97 O,A7 O47? 8,07 


Ooze 
S 
s 
= 
-_- 


4,0 P O02 0,02 Af? 8,02 8,02 8,22 4,22 4,02 8,02 1,02 
‘ " 0,01 O,A1 A.A' O.91 8,03 O41 8,H1 A,AL 8,81 08,a1 
C 0.66 8,06 A,.76 4,06 4.896 0,096 8,06 0,06 4,86 0,06 


FABLE IS Modiffed Coefficients of Variation for Network Performance 
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SAMPLE ; NUMBER OF REQUEST CLASSES 
SIZE ; 
| CXYEAR) 1 2 3 4 5 6 7 a 9 1a 


CuK 0,16 

4.2 PJK 9,46 4,99 O.11. O33 4.14 O,16 G,17 4,38 0,19 O,20 
WJIK 8,095 2,07 8,09 ALM A132 O.13 0.149 A.15 G.16 8,17 
CJK 4,15 9,21 @,25 4,29 9,33 4,36 8,39 0,42 0,94 @,47 


1.6 PJK @,96 0,98 Me1% ell y12 0,14 @elS 16 0617 0.18 


2.4 PJK O.AS 0,07 4.09 0,10 O,11 Bel2 M613 8.14 O.15 0,16 
WJK 8,04 0,06 O07 3.08 BAY BIA Ail Bell O,12 0,13 
CJK 8.11 @,16 @.2% 8,23 0,25 0,28 .39 U,32 0,34 0,36 


2.4 PJK O95 ¢.%6 O38 O,A9 A,19 A111 MiP B13 0,14 8,14 
WJIK 4,00 6,05 0.96 4.97 9,08 8,09 6,14 08.1% 4.11, O12 
\ CJK A190 A415 9,15 B2t G,23 0.25 ,27 8,29 8,31 U,33 


2.8 PJK 9,46 0.06 @,.97 0,08 0,09 0,10 8,11 O12 A, 
WJIK 2,23 9,05 0.96 @,07 UA 8,08 AQAF UelM O1H O11 
CUK 6.10 6,14 M017 0.19 @,21 0,23 0,25 0,27 @ 


3,2 PK O.44 0,06 0.97 G.98 4.79 A,1% 1,18 M1 O.12 A,12 
WJK 0,03 7,04 0,06 9,06 8.07 @,08 4,08 4,09 4,10 O,14a 
CJK. 8,09 13 A,1h A,18 0,238 8,22 4,24 4,25 8,27 8,28 


' 3,6 PJK 8,48 4,05 9.06 ,A7 A,AB 3.49 AIA 8,19 A 11 A118 
WJIK A433 ‘4.94 9,025 8,06 4,47 0,97 4,68. 409 4,49 0,10 
(CUK A,98 Ayt2. He15 M17 9.19 M2) M22 M,24 0,25 0,27 


WJIK 8.43 0,99 0,95 @,46 8,96 0,97. 0,08 0,08 4,09 8,09 
CJR G,78 O11 O.34 O16 0,18 9,28 &.2i 8,23 8,24 6,25 


a tae ~~TABLE TI: Modified Coefficients of Variation fot Type jk Performance eae Fs 
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; wen 
in network performance might be used with 10% uncertainty "ia type jk 


ry 
performance. One might then choose N so as to achieve. und Sh yoal for 
‘ ‘g 2 
network performance. Tf the 10% goal for type jk performatice is not achieved 


Bie the weciehawaadies value of N, then J might. be decreased; Qf course, this 


approach is only one of a variety of possible method's "vow ehosbine N and J. ~ 


a] 
G- 
at 


: “ 8% 
CONCLUS LONS <3 


nevertheless very important. In an attempt to deal with th 


The problem considered in this paper is day Seine but 
L chic 


i 


we have resorted to various approximations. The Fegult ij what we feel 


-* 
to be a reasonable method for deciding how much data, to det and how 


nr 4 we 


the data should be aggregated. pee eae 
The approach developed here is currently ‘being used as a means 

of deciding when to terminate mite collection in thorheudy of a regioal 

network within Illinois [4]. etek it is being sed deeiue’ por ° ia 

the number of request classes to use in, the policy Ungly eine performed 


with our library network model.  @. 9’ 
Another interesting potential application af sey atid praposed 


here is in the study of time variations in average networkyger formance. 


The sample size considerations resulting from analyses aunthe Ag sleveloped 


‘ 


. 
here put constraints on the type of time variations which can *be studied. 


we a eek : %, roe 


For example, if your accuracy requirements dictate acrtog srtle 5 tien ong | 


cannot study weekly variations of average demand unless, off firted Sone is 
a 8 ie : 


. 
im 


willing to space his sampling out over the whole P, : aa 
+ | 


mneicacane 
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Another data collection issue concerns how long one should sample 
to obtain a certain sample size. Sie exangle: one may have to collect 
data for 15% of the year to actually accumulate complete data on any parti- 
cular period equal to 10% of the year. This is due to the fact that requests 
stay in a network for a period of time: A simple model which is of use 


in answering this question is discussed in reference 4. 


The accuracy of the output of mathematical models is, for the 


, 7 
most part, dictated by the accuracy of the input data or parameters. 


The purpose of this paper has been to-suggest a method of dealing with 


this problem. 
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oe? 3 APPENDIX 

ue 2" ‘ ‘ Since the publication of the paper in which our library network 


7%, 
model was derived, we have developed a new version of the model that 


requires fewer assumptions and allows more robust representations of library 
Shy , networks. In this appendix, we will present the derivation of this new 


Lath vérsion’of the model. 


o a a ee 2 Assume that we have K requesting Libraries who demand resources 
a er 

ad 7 ip 2 request classes. We will define Mik} GS Ly2ysanpde Re 12050, K3 
‘aa * "Sas re average demand (per unit time) in class j from source k.- This will 
= ee : ee termed type jk demand, ‘ i 

ae 7 -. Type jk demand will be routed through the network using a routing 
: ee vector with elements Tike! L= 1,2, +s 9lays where the ayth element of the 
es 3 2. reiting vector is ae designation of the resource Library to which the 

. Ua “A epijuayt will be eer if it has not been satisfied at the 4-1 resource 

sy eich specified ‘bythe first t-1 elements of: the routing vector. 

ig " am Given. the demand data and ‘routing vectors (or. policies), we would 
‘hd oa : ‘Ake to determine My 7) which is the average type jk demand (per unit time) 


2 


ay Mee ae On, cfgsaree Library i; i= 1,2,...,1; at routing stage £. This will be 
“ & 


# 


ae « tgrnefr Womand. 

Ps va 

‘ 7 : To compute Mike we must determine the probability that a type 
rae ‘ Py 

re aik sie a will bé satisfied at its 2° routing stage. (This probability 

vege? . will be denoted by p, kat -) For this to occur, the request must pass 

OF 

ia ' fae unsagigf ied through its previous £-1 routing ag i Thus ,* 


v as Pikes = p(r kg? ily % 1 fs “PC jy j|m)) (Al) 


Subscripts are sometimes bracketed for clarity. However, the meaning 
: loes _not change, For example, Bale —pldyj J.4)- -are-equivalent..—-Alsoj~ 
is 3 d-1 rN 
a nats that for fl, Al, ¢ - ) : 1.0 and = -( . ) = 0.0. 


* 4 m= ml 
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% 
egw 


where p(i,j]2) is the probability that resource library i will satisfy 

a type j2 request. Note that we have assumed p(i, jl 2) to depend on the 
number of resource libraries that have processed che ceaueat previously. 
This reflects the possibility of requests with large 2 being "difficule" 
in the sense that £-1 resource libraries have been unable to satisfy them. 
While we might allow p(i, jl 4) to depend on the specific £-1 resource 
libraries who have unsuccessfully processed the request, this would pose 
enormous data collection problems in that parameters for numerqus unlikely 
routing vectors would have to be evaluated. 


Resource library i.can be described as a subnetwork of My subnodes 


with transitions between the subnodes described by transition probability 


matrix P Defining ) j as an M, vector which represents the average 


ijk ijZ i 


network demand directly entering the Me subnodes ,” then 


af 
= iS ' 
Sug" hl ae Bigs Mag) 


where I is an My x My identity matrix, (*)' denotes the transpose of a 


matrix, ay denotes the inverse of a matrix, and Nya is an Mi vector 


whose ae element represents the average type j£ demand on the uf 


subnode of resource library i. 


Lf we agama” that By is independent of de then the fraction 


jZ jv’ 
of Ses reaching each subnode of resource library i is independent of 


ew 


Since subnetworks can be arbitrarily defined, we can with no loss in 


generality assume that the Lirst element of Asya 
element. This greatly aids in our computation of p(i, jl). 


" 


is its only non-sere 


£ eiys is related to Mie then these matrices will be interrelated 


among libraries and lead to our needing a simultaneous solution to a 


large number of nonlinear matrix equations. 
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iN - Thus, we can compute A for an arbitrary. 4 and the tract ton 


“ijl ijf " ips 
of Bay t reaching the subnode which represents request satisfaction is 
p(i,j |). 

‘ » Given p(i,j 2), we can use equation Al to compute Pike and then 
compute Mi jkee as follows. Let dike = 0 for all i,j,k, and &£.. Then, 
recursively use 

£-1 
(ei pdrks #) = Merged ay £) i Bins (1- ~P jem rik (A3) 
for all j,k, and 2. ‘this now allows us to compute 
K 
i = Z \ (A4) 


and thereby define the first and only non-zero element of Arye Equation 
A2 can then be employed to compute Nye 

To estimate the average time required for’a request to pass 
through*resource library i, we first must estimate the average time required 
for a request to pass through each subnode of resource library i. From a 
; queue ing perspective, this requires an estimate of the total average 
‘demand on each subnode which is given by™ 

a | 
“i es ne Aue ‘ail 
Defining Wi as ane; vector represent ing the average times to pass through 
the subnodes of. resource library i, then 
Wi = fF (Ais other parameters) (Ab) 


i. = Max (ty, or, more simply, L=l. ; _ 
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where F(*) is a matrix function reflecting the particular queueing model 


chosen to represent subnode processing. Defining Ly as an My vector whose 


» then 


atl element is the product of the at elements of Ay i 


L and w 


yr hey, (A7) 


ae ee rey Bie 


Lijs ijd 


where L is an M, vector whose me? element is the product of the as 


~ijh i 
elements of Mia and Wiys where Wiga is an M, vector representing the 


cumulative average time for a type j& request to pass through each of the 


subnodes of resource library i. Since A is known, we can solve for 


£ 


Miya 


Without loss of generality, we can assume that a stuyle subnode 
of resource library i represents the departure point for satisfied requests 
while another single subnode represents the departure point for unsatisfied 
requests. Now, define Wiig as the average time Co successfully pass Chrough 


the success subnode while W is the average time to pass through that 


ijf 


subnode which is the departure point for unsatisfied requests. These two 


variables are distinct elements of Wie 


Now, let w., , be the cumulative average time from initiation 


jk 
of a type jk request through its satisfaction at the ie stage of referral 


and the delivery of the desired document or infomation to the reques thay 


library. This variable is calculated using 


£-1 
» by j bw ye 2) + U(r, gjik AB 
Wika a WO ig gd ot wT ig j, 2) (i ged ) (A8) 
where ty is the averaye time for a class jf item bi be delivered trom 
; ik 


resource library i to requesting Library k. 


With one modification, we can calcahite average process fy costs 
in the same manner as average processing times by simply substituting costs 


for times in equations A7 and A8. Thus, using c, in equation A7 will lead 


i 


and ¢,,,) can be used in 


ijt jg WAS 
equation A8. We have to employ A& twice: ‘once to calculate the cumulative 


to computation of C whose elements (¢ 


cost of success rw) and once to.calculate the cumulative cost of failure 


Sich (by using Cis = Ci 


is only calculated for % = bik and reflects the fact that the overall cost 
nod 


of operating the network includes not only the investment if satisfied 


and zero delivery cost). The cost of failure 


requests but also the funds invested in requests that were net satistied. 
We can now calculate the performance experienced by a type jk 


request. The probability that a type jk request is satisfied is given by 


(A9) 


while the average time required from initiation of the request until 


receipt of the desired item is 


L 


i‘ pa Pica “jhe! Pix 


and the average cost of satisfying a type jk request is given by 
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The performance measures p k? Wik? and-c,, can be summed across 


j jk 
j and/or k (weighteu by diy? to yield additional aggregate performance 
measures. 

This new version of the library network model has been progranmed 


in approximately 175 FORTRAN statements and allows for arbitrary network 


and subnetwork contiguratious. 


